AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Step-by-step optimization strategy

# Step-by-step optimization strategy

R1 VL 7B
Apache-2.0
R1-VL-7B is an inference model based on Qwen2-VL-7B-Instruct, trained using the Stepwise Grouped Relative Policy Optimization (StepGRPO) method, focusing on the image-text to text task.
Image-to-Text Transformers
R
jingyiZ00
1,729
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase